![]() DEVICE, FM STEREO RADIO RECEIVER, MOBILE COMMUNICATION DEVICE, AND METHOD TO IMPROVE A LEFT/RIGHT OR
专利摘要:
enhancement of an audio signal from an FM stereo radio receiver by using a parametric stereo system. The present invention relates to an apparatus for enhancing a stereo audio signal from an FM stereo radio receiver. the apparatus comprises an estimation stage of parametric stereo parameters (ps). the parameter estimation stage is configured to determine one or more parametric stereo parameters, based on the stereo audio signal, in a frequency-variable or frequency-invariant manner. preferably, these ps parameters are variable with time and frequency. moreover, the apparatus comprises a stage of transformation into surrounding sound. the surround sound transform stage is configured to generate the enhanced stereo signal based on a first audio signal and the one or more parametric stereo parameters. the first audio signal is taken from the stereo audio signal, for example, by a stereo-transform operation in a stereo-transform stage. the parameter estimation stage ps can be part of a ps encoder. the upper mixing stage can be part of a ps decoder. 公开号:BR112012005534B1 申请号:R112012005534-8 申请日:2010-09-07 公开日:2021-08-17 发明作者:Jonas Engdegard;Heiko Purnhagen;Karl Jonas Roeden 申请人:Dolby International Ab; IPC主号:
专利说明:
TECHNICAL FIELD [0001] The present invention relates to audio signal processing, in particular to an apparatus and corresponding method for improving an audio signal of a stereo FM radio receiver. BACKGROUND [0002] In an analog FM (frequency modulation) stereo radio system, the left (L) and right (R) channel of the audio signal are carried in a lateral and intermediate (M/S) representation, this is, as an intermediate channel (M) and a side channel (S). The intermediate channel M corresponds to a sum signal of L and R, eg M = (L + R)/2, and the side channel S corresponds to a difference signal of L and R, eg S = ( L - R)/2. For transmission, the side channel S is modulated onto a 38 kHz suppressed carrier and added to the baseband intermediate M signal to form an inverse compatible stereo multiplex signal. This multiplex signal is then used to modulate the HF (high frequency) carrier of the FM transmitter, typically operating in the range between 87.5 to 108 MHz. [0003] When reception quality decreases (ie, the signal-to-noise ratio over the radio channel decreases), the S channel typically suffers more than the M channel. In many FM receiver implementations, the S channel is muted when reception conditions become too noisy. This means that the receiver switches from stereo to mono in case of a poor HF radio signal. [0004] Parametric Stereo (PS) coding is a technique in the field of very low bit capacity audio coding. PS provides encoding of a 2-channel stereo audio signal as a mono downmix signal in combination with PS side information, ie the PS parameters. The mono downmix signal is obtained as a combination of both channels of the stereo signal. PS parameters allow the PS decoder to reconstruct a stereo signal from the mono downmix signal and the PS side information. Typically, PS parameters vary with time and frequency, and PS processing in the PS decoder is typically conducted in a hybrid filterbank domain incorporating a QMF bank. The document "Low Complexity Parametric Stereo Coding in MPEG-4", Heiko Purnhagen, Proc. Digital Audio Effects Workshop (DAFx), p. 163 - 168, Naples, IT, October 2002, describes an exemplary PS encoding system for MPEG-4. His discussion of the parametric stereo system is incorporated by reference into this descriptive report. Parametric stereo system is supported, for example, by MPEG-4 Audio. Parametric stereo system is discussed in section 8.6.4 and in annexes 8.A and 8.C of the MPEG-4 ISO/IEC 14496-3:2005 standardization document ("MPEG-4 Audio", 3rd edition). These parts of the standardization document are incorporated into this descriptive report by reference for all purposes. Parametric stereo system is also used in the MPEG Surround standard (see ISO/IEX 230031:2007, MPEG Surround). Also, this document is incorporated by reference into this descriptive report for all purposes. Other examples of parametric stereo coding systems are discussed in "Binaural Cue Coding - Part I: Psycoacoustic Fundamentals and Design Principles", Frank Baumgarte and Christof Faller, IEEE Transactions on Speech and Audio Processing, vol. 11, no. 6, pages 509 - 519, November 2003, and in "Binaural Cue Coding - Part II: Schemes and Applications", Christof Faller and Christof Faller, IEEE Transactions on Speech and Audio Processing, vol. 11, no. 6, pages 520 - 531, November 2003 In these two documents, the term "binaural indication coding", which is an example of parametric stereo coding. " which are derived, for example, according to L = M + S and R = M - S). When an S-side signal has only poor to intermediate quality, there are two options: either the receiver chooses to accept the noise associated with the S-side signal and transmit the real stereo, or the receiver downloads the S-side signal and puts it in mono. Poor to intermediate quality, there are two options: either the receiver chooses to accept the noise associated with the S-side signal and transmit the real stereo, or the receiver downloads the S-side signal and puts it in mono. [0006] The international patent application WO 2008/032255 describes a decoder, which is arranged to modify the sweet spot of an audio signal of M spatial channels by modifying spatial parameters. Specifically, a receiver receives an N-channel audio signal, where N < M. The M-channel signal can be an MPEG Surround sound signal and the N-channel signal can be a stereo signal. A parametric unit determines the spatial parameters relating to the N-channel audio signal to the M-channel audio signal, and a modification unit modifies the sweet spot of the M-channel audio signal by modifying at least one of the parameters space. A generator unit then generates the M spatial channel audio signal by upmixing the N channel audio signal using the at least one modified spatial parameter. SUMMARY OF THE INVENTION [0007] A first aspect of the invention relates to an apparatus for enhancing an audio signal from a stereo FM radio receiver. The device generates a stereo audio signal. The audio signal to be enhanced may be an audio signal in L/R representation, i.e., an L/R audio signal, or in an alternative embodiment an audio signal in M/S representation, i.e., a signal M/S audio Typically, the audio signal to be enhanced is an audio signal in L/R representation, since conventional FM radio receivers use an L/R output. [0008] As an exemplary embodiment of the present invention, the apparatus is for an FM stereo radio receiver, configured to receive an FM radio signal, comprising an intermediate signal and a side signal. [0009] The apparatus comprises a parametric stereo parameter estimation stage. The parameter estimation stage is configured to determine one or more PS parameters based on the L/R or M/S audio signal and a way of varying with frequency or not varying with frequency. The one or more parameters can include a parameter indicative of the intensity differences between the channels (IID or also called CLD - channel level differences) and/or a parameter indicative of a cross correlation between the channels (ICC). Preferably, these PS parameters are time- and frequency-varying. [00010] Furthermore, the apparatus comprises an upmix stage. The upmix stage is configured to generate the stereo signal based on a first audio signal and one or more PS parameters. [00011] The first audio signal is taken from the L/R or M/S audio signal, for example, by a downmix operation in a downmix stage. The first audio signal can be obtained from the audio signal in the case of an L/R representation by a downmix operation according to the following formula: DM = (L + R)/a, with DM corresponding to the first audio signal . For example, parameter a is selected to be 2. In the case of DM = (L + R)/a, the first audio signal essentially corresponds to the intermediate received signal M. In advanced adaptive downmix schemes, the two parameters a1 , a2 to combine the two channels according to the formula DM = L/a1 + R/a2 may be different, and/or may depend on PS parameters and/or other signal properties. [00012] In case of an M/S representation, at the output of the FM stereo radio receiver, the first audio signal can simply correspond to the M signal of the M/S audio signal at the output. [00013] The PS parameter estimation stage can be a part of a PS encoder. The upmix stage can be part of a PS decoder. [00014] The apparatus is based on the idea that, due to its noise, the received side signal may not be good enough to reconstruct the stereo signal by simple combination of the intermediate and received side signals; nevertheless, in this case the side signal, or the side signal component in the L/R signal, may still be good enough for stereo parameter analysis in the PS parameter estimation stage. These PS parameters can then be used to reconstruct the stereo signal. [00015] Thus, the device allows an improved stereo reception, under conditions of intermediate noise or even large on the side signal. It should be noted that the term "noise" is usually used to refer to noise introduced from the limitations of the radio transmission channel (as opposed to the signal component in the form of noise in the actual audio signal being broadcast) . [00016] Instead of using a noisy received side signal to create the stereo audio signal, an enhanced side signal generated at the receiver can be used. The enhanced side signal can be generated with the help of PS encoding techniques. These include, for example, the generation of enhanced side signal components from a decorrelator, operating on the first audio signal as an input. Data on reception conditions and/or an analysis of the received stereo signal can be used to adaptively control the generation of the enhanced side signal as well as the generation of the audio output signals. [00017] According to another embodiment, the apparatus further comprises a de-correlated, configured to generate a de-correlated signal based on the first audio signal. The upmix stage can generate the stereo signal based on the first audio signal, the one or more PS parameters and the de-correlated signal, or at least the frequency band of the de-correlated signal. [00018] Instead of using the uncorrelated signal, the upmix stage can use the received side signal for the upmix, for example, in case of good reception conditions, when the noise of the received side signal is low. Therefore, according to an embodiment, for upmix, selectively, the received side signal or the decorrelated signal is used. Particularly, the section is variable with frequency. For example, the upmix stage can use the received side signal for lower frequencies, and can use the uncorrelated signal as a pseudo side signal for higher frequencies, since the higher the frequency, the greater the noise density. This is a typical property of FM demodulation in case of additive (white) noise in the radio channel. This will be explained in detail below in the descriptive report. [00019] The received side signal or at least one or more of its frequency components can be used for upmix, if the first signal matches the intermediate signal. In case of a different downmix scheme (which is different from (L + R)/a, to generate the first audio signal), a residual signal can be used for the upmix, instead of using the received side signal. This residual signal indicates the error associated with representing the original channels by their downmix and PS parameters, and is often used in PS encoding schemes. The above remarks for the use of the received side signal also apply to a residual signal. [00020] The selection between the received side signal and the uncorrelated signal, for upmix, can be signal dependent or, in other words, signal adaptive. [00021] According to yet another embodiment, the selection depends on the reception conditions indicated by a radio reception indicator, such as the signal strength and/or an indicator indicative of the quality of the received side signal. In case of good reception conditions (ie high intensity), the received side signal can preferably be used for upmix (in some cases not for the higher frequencies), whereas in case of reception conditions intermediate (ie, a lower intensity), the uncorrelated signal can be used in the upmix. [00022] In very poor reception conditions, with high noise levels in the side signal, the FM receiver can switch to a mono output mode to decrease the noise of the audio signal. In case of a stereo L/R audio signal, at the FM receiver output, both channels at the output have the same signal in mono playback. In case of an M/S stereo signal, at the FM receiver output, the S channel at the output is muted. In mono output mode, stereo information is missing from the audio signal from the FM receiver. Therefore, the PS parameter estimation stage cannot determine the proper PS parameters to create a real stereo signal in the upmix stage. Even if the FM receiver does not switch to mono output mode under very poor reception conditions, the audio signal at the FM receiver output may be too bad to estimate the significant PS parameters. [00023] The apparatus can be configured to detect if the noise has selected the mono output of the stereo radio signal, and/or it can be configured to detect these poor reception conditions (which are too poor for estimating significant PS parameters). In case of mono output detection, or in case of detection of these poor reception conditions, the upmix stage can generate a stereo pseudo signal. The upmix stage uses one or more upmix parameters to blind the upmix instead of the estimated parameters as discussed above. This mode is referred to as a pseudo stereo operation or a blind upmix operation. [00024] The blind upmix operation specifies, in this case, that after detecting poor reception conditions or detecting the mono output and thereby starting the blind upmix operation, spatial acoustic information - if at all present - in the signal FM receiver output switches are not used to determine the upmix parameters, and thus are not considered for the upmix (if there is already a mono output on the FM receiver output, no spatial acoustic information will be present and, therefore, they cannot be considered at all). Compared to the operating mode discussed above, in which the PS parameters are determined to reconstruct the side signal into the upmix stage output signal, in blind upmix operation, the apparatus does not seek to reconstruct the side signal into the stage output signal. of upmix. [00025] However, blind upmix does not mean that the device is "blind", since the upmix parameters are necessarily independent of the FM receiver output signal. For example, the FM receiver output signal can be monitored whether it is music or speech, and as a result, upmix parameters can be selected. [00026] One embodiment for blind upmix is to use preset upmix parameters. Preset upmix parameters can be default or stored upmix parameters. [00027] Notwithstanding, the upmix parameters used can be signal dependent, for example the upmix parameters for speech and the upmix parameters for music. In this case, the device also has a speech detector (for example, a speech/music discriminator), which detects whether the audio signal is predominantly speech or music. For example, in the case of pure music, the upmix parameters can be selected so that the downmix signal and its uncorrelated versions are mixed, while in the case of pure speech, the upmix parameters can be selected so that the uncorrelated version of the downmix signal is not used, and only the downmix signal is used to upmix to a "mono" left/right signal. In the case of an audio signal being a mixture of speech and music, blind upmix parameters can be used, which are in the range between the upmix parameters for pure speech and the upmix parameters for pure music. Interpolated upmix parameters for all states in that range can also be used. [00028] Advanced blind upmix schemes for pseudostereo can be considered, in which an even more advanced analysis of the mono signal is conducted, and this is used as the basis for deriving the "artificially generated" or "synthetic" PS parameters. [00029] For a side signal with practically only noise, the apparatus preferably switches to a pseudostereo mode, as discussed above. As mentioned above, the term "noise" refers, in this specification, to the noise introduced by poor radio reception (ie, the low signal-to-noise ratio in the radio channel), not to the noise contained in the original sent signal. for the FM broadcast transmitter. [00030] However, for an almost noise free side signal, i.e. almost no noise originating from the FM radio transmission, the device preferably switches to the normal stereo mode instead of the parametric stereo mode. In normal stereo mode, the device's signal enhancement feature is essentially disabled. For deactivation, the left/right audio signal at the input of the device can essentially be fed from the output of the device. [00031] Alternatively, for deactivation, only the received side signal (not the decorrelated signal) is mixed with the first audio signal, in the upmix stage. When properly selecting the upmix parameters in the upmix stage, the output signal from the upmix stage corresponds to the output signal from the FM transmitter; for example, when mixing the first DM audio signal and the received side signal So, according to: L' = DM + So and R' = DM - So, in the case where DM = (L + R)/2 and So = (L - R)/2. [00032] Particularly, in some cases, the normal stereo mode from the parametric stereo mode can be selected in a frequency-varying way, ie the selection can be different for different frequency bands. This is useful as the signal-to-noise ratio for the received side signal is characteristically worse at higher frequencies. As discussed above, this is a typical property of FM demodulation. [00033] Other embodiments of the apparatus are discussed in the dependent embodiments. [00034] A second aspect of the invention relates to an apparatus for generating a stereo signal based on the left/right or intermediate/side audio signal of an FM stereo radio receiver. The player is set to detect that the FM stereo receiver has selected mono output from the stereo radio signal, or the player is set to detect poor radio reception. The apparatus comprises a stereo upmix stage. The upmix stage is configured to generate the stereo signal based on a first audio signal and one or more upmix parameters for blind upmix, in case the device detects that the FM stereo receiver has selected a mono output of the stereo radio signal, or the device detects poor reception. The first audio signal is taken from the left/right or middle/side audio signal. [00035] Upmix parameters for blind upmix can be preset parameters such as default or stored parameters. [00036] The device provides the generation of a pseudo stereo signal, having a low noise level, in case of very bad reception conditions, with high noise levels on the signal side. Under these receiving conditions, the FM receiver may switch to mono mode to decrease the noise of the audio signal, or the L/R or M/S audio signal may be too bad for estimating meaningful PS parameters. This is detected and then blind upmix of upmix parameters are used to generate a stereo pseudo signal. This has already been discussed in conjunction with the first aspect of the invention. [00037] As also discussed in conjunction with the first aspect of the invention, the apparatus may comprise a detection stage for detecting whether the FM stereo receiver has selected mono output from the stereo radio signal. [00038] According to an exemplary embodiment, the apparatus further comprises an audio type detector, such as a speech detector, indicating whether the audio signal, at the output of the FM transmitter, is predominantly speech or not. In this case, the upmix parameters are dependent on the speech detector indication. For example, the apparatus uses upmix parameters in the case of speech and different upmix parameters in the case of music, as discussed in detail in conjunction with the first aspect of the invention. [00039] The apparatus according to the second aspect of the invention may further include the features of the apparatus according to the first aspect of the invention, and vice versa. [00040] A third aspect of the invention relates to an FM stereo radio receiver, configured to receive an FM radio signal, comprising an intermediate signal and a side signal. The stereo radio receiver includes an apparatus for enhancing the audio signal in accordance with the first and second aspects of the invention. [00041] A fourth aspect of the invention relates to a mobile communication device, such as a cell phone. The mobile communication device comprises an FM stereo receiver configured to receive an FM radio signal. Furthermore, the mobile communication device comprises an apparatus for improving the audio signal, in accordance with the first and second aspects of the invention. [00042] A fifth aspect of the invention relates to a method for enhancing a left/right or intermediate/side audio signal from an FM stereo radio receiver. The characteristics of the method according to the fifth aspect correspond to the characteristics of the apparatus according to the first aspect. One or more PS parameters are determined based on the left/right or intermediate/side audio signal in a frequency-variant or frequency-invariant manner. The stereo signal is generated based on said first audio signal and one or more PS parameters by an upmix operation. [00043] The remarks for the first aspect of the invention also apply to the fifth aspect of the invention. [00044] A sixth aspect of the invention relates to a method for generating a stereo signal based on the left/right or middle/side audio signal of an FM stereo radio receiver. The characteristics of the method according to the sixth aspect correspond to the characteristics of the apparatus according to the second aspect. It is noted that the FM stereo receiver has selected the mono output of the stereo radio signal, or, in an alternative embodiment, lower radio reception is observed. In case the FM stereo receiver has selected the mono output of the stereo radio signal, or in the case of poor radio reception, the stereo signal is generated based on a first audio signal and one or more upmix parameters for blind upmix , such as the preset upmix parameters. [00045] The considerations for the second aspect of the invention also apply to the sixth aspect of the invention. DESCRIPTION OF DRAWINGS [00046] The invention is explained below by means of illustrative examples, with reference to the attached drawings, in which: Figure 1 illustrates a schematic embodiment for improving the stereo output of a stereo FM radio receiver; Figure 2 illustrates an embodiment of the audio processing apparatus, based on the parametric stereo concept; Figure 3 illustrates another embodiment of PS-based audio processing apparatus having a PS encoder and a PS decoder; Figure 4 illustrates an extended version of the audio processing apparatus of Figure 3; Figure 5 illustrates an embodiment of the PS encoder and PS decoder of Figure 4; Figure 6 illustrates an exemplary structure of the S signal used for upmix; Figure 7 illustrates an extended version of the audio processing apparatus of Figure 3, to which a noise reduction algorithm is added; Figure 8 illustrates another embodiment of audio processing apparatus, with noise reduction, for estimating PS parameters; Figure 9 illustrates another embodiment of the audio processing apparatus for pseudostereo generation, in the case of only mono output from the FM receiver; Figure 10 illustrates the occurrence of short stereo output dropout periods, in stereo reproduction, at the FM receiver output; Figure 11 illustrates an error-compensated PS parameter estimation stage; and Figure 12 illustrates another embodiment of the audio processing apparatus, based on the HE-AAC v2 encoder. DETAILED DESCRIPTION [00047] Figure 1 shows a simplified schematic embodiment for improving the stereo output of an FM stereo radio receiver 1. As discussed in the background section, in FM radio, the stereo signal is transmitted by design as an intermediate signal and a side sign. On FM receiver 1, the side signal is used to create the stereo difference between left channel L and right channel R at the output of FM receiver 1 (at least when reception is good enough and side signal information is not are muted). Left and right L, /R channels can be digital or analog signals. To enhance the L, R audio signals from the FM receiver, an audio processing device 2 is used, which generates an L' and R' stereo audio signal at the output. The audio processing apparatus 2 corresponds to a system, which is enabled to conduct noise reduction of a received FM radio signal, using a parametric stereo system. The audio processing in apparatus 2 is preferably conducted in the digital domain; therefore, in the case of an analog interface, between FM receiver 1 and audio processing device 2, an analog-to-digital converter is used, before digital audio processing in device 2. FM receiver 1 and the audio processing apparatus 2 can be integrated in the same semiconductor integrated circuit, or can be part of two semiconductor integrated circuits. The FM receiver 1 and the audio processing apparatus 2 can be part of a wireless communication device, such as a cell phone, a personal digital assistant (PDA) or a smart phone. In that case, the FM receiver 1 can be part of the baseband integrated circuit, having an FM radio receiver functionality. [00048] Instead of using a left/right representation at FM receiver output 1 and device 2 input, an intermediate/side representation can be used at the interface between FM receiver 1 and device 2 (see M, S in figure 1 for the intermediate/lateral representation and L, R for the left/right representation). This intermediate/side representation, at the interface between FM receiver 1 and device 2, can result in less effort, since FM receiver 1 already receives an intermediate/side signal, and audio processing device 2 can directly process the intermediate/side signal without downmixing. The intermediate/side representation can be advantageous if the FM receiver 1 is tightly integrated with the audio processing apparatus 2, in particular if the FM receiver 1 and the audio processing apparatus 2 are integrated on the same integrated circuit semiconductor. [00049] Optionally, a signal of signal strength 6, indicative of the radio reception condition, can be used to adapt the audio processing in the audio processing apparatus 2. This will be explained later in this descriptive report. [00050] The combination of FM radio receiver 1 and audio processing apparatus 2 corresponds to an FM radio receiver having an integrated noise reduction system. [00051] Figure 2 shows an embodiment of the audio processing apparatus 2, which is based on the concept of parametric stereo system. Apparatus 2 comprises a PS parameter estimation stage 3. Parameter estimation stage 3 is configured to determine PS parameters 5 based on the input audio signal to be enhanced (which can be a left/right or middle representation /side). The PS 5 parameters may include, among others, a parameter indicative of the differences in intensity between the channels (IID or also the so-called CLD - channel level differences) and/or a parameter indicative of a cross correlation between the channels (ICC) . Preferably, the PS 5 parameters are time- and frequency-varying. In the case of an M/S representation, at the input of parameter estimation stage 3, parameter estimation stage 3 can nevertheless determine the PS 5 parameters which refer to the L/R channels. [00052] A DM audio signal is taken from the input signal. In the case where the input audio signal already uses an intermediate/side representation, the DM audio signal can directly correspond to the intermediate signal. In the case where the input audio signal has a left/right representation, the audio signal is generated by downmixing the audio signal. Preferably, the resulting signal DM, after downmix, corresponds to the intermediate signal M and can be generated by the following equation: DM = (L + R)/a, for example, with a = 2, i.e. the downmix signal DM it can correspond to the average of the L and R signals. For different values of a, the average of the L and R signals is amplified or attenuated. [00053] The apparatus also comprises an upmix stage 4, also called stereo mixing stage or upmixer transformer. Upmix stage 4 is configured to generate a L', R' stereo signal based on the DM audio signal and PS 5 parameters. Preferably, upmix stage 4 not only uses the DM signal, but also uses a signal lateral, or some type of lateral pseudosignal (not shown). This will be explained below in the descriptive report together with further embodiments extended in figures 4 and 5. [00054] The apparatus 2 is based on the idea that, due to its noise, the received side signal can be too noisy to reconstruct the stereo signal, by simple combination of the received intermediate and side signals; nevertheless, in this case, the side signal, or the side signal component in the L/R signal, may still be good enough for stereo parameter analysis in the PS parameter estimation stage 3. These resulting PS parameters 5 can then be used to generate a L', R' stereo signal having a reduced noise level compared to the audio signal directly at the noise output 1. [00055] In this way, a bad FM radio signal can be "cleaned up" by using the parametric stereo concept. Most of the distortion and noise in an FM radio signal is located in the side signal, which may not be used in PS downmix. Nevertheless, the side channel is, even in the case of poor reception, often of sufficient quality for extracting PS parameters. [00056] In all the drawings presented below, the input signal to the audio processing apparatus 2 is a left/right stereo signal. With minor modifications in some modules within the audio processing apparatus 2, the audio processing apparatus 2 can also process an input signal in the intermediate/side representation. Therefore, the concepts discussed in this descriptive report can be used in conjunction with an input signal in the middle/side representation. [00057] Figure 3 shows an embodiment of the PS 2 based audio processing apparatus, which makes use of a PS 7 encoder and a PS 8 decoder. The parameter estimation stage 3 is, in this example, part of the encoder PS 7, and upmix stage 4 is part of the PS 8 decoder. The terms "PS encoder" and "PS decoder" are used as names to describe the function of the audio processing blocks inside player 2. It should be noted that all audio processing takes place in the FM receiver device. These PS encoding and PS decoding processes can be tightly coupled, and the terms "PS encoding" and "PS decoding" are only used to describe the legacy of audio processing functions. [00058] The PS 7 encoder generates - based on the stereo audio input signal L, R - the DM audio signal and the PS 5 parameters. Optionally, the PS 7 encoder still uses a signal of signal strength 6. O DM audio signal is a mono downmix and preferably corresponds to the received intermediate signal. When summing the L/R channels to form the DM signal, the received side channel information can be completely excluded in the DM signal. Thus, in this case only the intermediate information is contained in the mono DM downmix. Therefore, any side channel noise can be excluded in the DM signal. However, the side channel is part of the analysis of stereo parameters in encoder 7, as encoder 7 typically considers L = M + S and R = M - S as input (consequently, DM = (L + R)/2 = M). [00059] Experimental results indicate that a received side signal, which contains intermediate noise levels, may not be good enough for the reconstruction of the stereo itself, but it may be good enough for the analysis of stereo parameters in a PS 7 encoder. [00060] The DM mono signal and PS 5 parameters are subsequently used in decoder 8 to reconstruct the stereo signal L', R'. [00061] Figure 4 shows an extended version of the audio processing apparatus 2 of figure 3. In this case, in addition to the DM mono downmix signal and the PS parameters, also the originally received side signal So is passed to the decoder 8. This approach is similar to the "residual coding" techniques of PS coding, and makes use of at least parts (eg, certain frequency bands) of the received side signal So, in the case of good but not perfect, reception conditions. The received side signal So is preferably used in the case where the mono downmix signal does not correspond to the intermediate signal. i, more generic residual signal can be used in place of the received side signal So. This residual signal indicates the error associated with representing the original channels by their downmix and PS parameters, and is often used in PS encoding schemes. In the following, considerations for using the received side signal So also apply to a residual signal. [00062] The use of a residual signal in a PS encoder/decoder is, for example, described in the MPEG Surround standard (see document ISO/IEC 23003-1:2007, MPEG Surrond) and in the article " MPEG Surrond - The ISO/ MPEG Standard for Efficient and Compatible Multi-Channel Audio Coding", J. Herre et al., Audio Engineering Convention Paper 7084, 122nd Convention, May 5-8, 2007. [00063] Figure 5 shows an embodiment of the PS encoder 7 and the PS decoder 8 of figure 4. The PS encoder module 7 comprises a downmix generator 9 and a PS parameter estimation stage 3. For example, the downmix generator 9 can create a mono DM downmix, which preferably corresponds to an intermediate signal M (eg DM = M = (L + R)/a), and can also optionally generate a second signal, which corresponds to the received side signal So = (L - R)/a. [00064] PS parameter estimation stage 3 can estimate as PS 5 parameters the correlation and level difference between L and R inputs. Optionally, parameter estimation stage receives signal strength 6, which can be the signal strength on the FM receiver. This information can be used to decide on reliability, for example, in case of low signal strength 6, the PS 5 parameters. In the case of low reliability, the PS 5 parameters can be adjusted so that the output signal L', R' is either a mono output signal or a stereo output pseudo signal. In the case of a mono output signal, the output signal L' is equal to the output signal R'. In the case of a stereo output pseudo signal, the standard PS parameters can be used to generate either a stereo output pseudo signal or a standard L', R' stereo output signal. [00065] The decoder module 8 comprises a stereo mix matrix 4a and a decorrelator 10. The decorrelator receives the mono DM downmix and generates a decorrelated signal S', which is used as a pseudo side signal. Decorrelator 10 can be performed by a suitable full pass filter, as discussed in section 4 of the cited document "Low Complexity Parametric Stereo Coding in MPEG-4". The stereo mix matrix 4a is, in this embodiment, a 2 x 2 upmix matrix. [00066] Depending on the estimated parameters 5, matrix 4a mixes the DM signal with the received side signal So, or the decorrelated signal S', to create the stereo output signals L' and R'. The selection between signal So and signal S' may depend on a radio reception indicator indicative of reception conditions, such as signal strength 6. A quality indicator may be used in place or in addition to it, indicative of the quality of the received side signal. An example of this quality indicator might be an estimated noise (power) of the received side signal. In the case of a side signal comprising a high degree of noise, the uncorrelated signal S' can be used to create the stereo output signal L' and R', while in low noise situations, the side signal So can be used. Various embodiments for estimating received side signal noise are discussed below in this descriptive report. [00067] As an example, in case of good reception conditions (ie signal strength is high), signal So is used for upmix, while in case of bad conditions, upmix is based on uncorrelated signal S' . Preferably, the decision whether the stereo mix module 4 uses the received side signal So or S' is frequency dependent, eg for lower frequencies the received side signal So is used, and for higher frequencies the signal uncorrelated S' is used. This will be discussed in detail in conjunction with Figure 6. [00068] Frequency-variant or frequency-invariant selection between signal So and signal S' can be made in upmix stage 4 (for example, by the selector means in upmix stage 6, which is controlled, for example, depending on the signal strength 6). Alternatively, frequency-variant or frequency-invariant selection can be conducted in parameter estimation stage 3 (for example, in dependence on signal strength 6), and parameter estimation stage 3 then sends the upmix parameters for upmix stage 6, which cause, respectively, the selected signal (So or S') is used for the upmix, for example, the upmix parameters relative to the So signal are set to zero, and the parameters relative to S'are not set to zero in the case of S' selection. Alternatively, a select signal (not shown) can be sent to upmix stage 6. [00069] The upmix operation is preferably conducted according to the following matrix equation: . [00070] In this case, the weight factors α, β, y, δ determine the weight of the DM and S signals. The mono downmix preferably corresponds to the received intermediate signal. The signal S in the formula corresponds to the decorrelated signal S' or the received side signal So. The elements of the upmix matrix, ie the weight factors α, β, y, δ can be derived, for example, as shown in the cited article "Low Complexity Parametric Stereo Coding in MPEG-4" (see section 2.2), as shown in the cited MPEG-4 standardization document ISO/IEC 14496-3:2005 (see section 8.6.4.6.2), or as shown in the ISO/IEC 23003-1 specification document (see section 6.5.3.2) . These sections of the documents (and also the sections referred to in those sections) are incorporated by reference into this specification for all purposes. [00071] Preferably, the selection between S' and So is often dependent. This is shown in Figure 6 indicating an exemplary structure of the S signal used for upmix. As indicated in figure 6, for lower frequencies, the received side signal So is used for upmix, and, for higher frequencies, the decorrelated signal S' is used for upmix. [00072] If the received side signal So corresponds to So = (L - R)/2 and L' = M + So and R' = M - So, the mono DM downmix should preferably correspond to (L + R )/two; this allows for perfect reconstruction, that is, L' = L and R' = R. [00073] Instead of using a PS upmixer, using the received side signal So, a generic PS upmixer, using a residual signal, can be used. The resulting signals L', R' are functions of the PS, residual signal and mono downmix parameters. [00074] Figure 7 shows an exemplary embodiment using noise reduction. As in figure 5, in figure 7, the signal So is optional. In the case of having an So signal, a common noise reduction algorithm can be used, which leads to noise reduction of the DM and So signals. Alternatively, two differentially configured noise reduction modules can be used, one for DM signal noise reduction and one for So signal noise reduction. It is also possible that only one signal can be subjected to noise reduction (eg the DM signal or the So signal). In figure 7, the noise reduction stage 11 conducts the noise reduction of the DM signal, and the noise reduction signal DM', after noise reduction, is fed to the PS decoder 8 and its internal upmix stage 4. noise reduction stage 11 conducts noise reduction of the signal So, and the reduced noise signal So', after noise reduction, is fed to the PS 8 decoder. [00075] Figure 8 shows another embodiment of apparatus 2. In this case, a noise reduction method 12 is applied to the stereo input signal, the resulting reduced noise signal R', L' is then analyzed by the estimation stage of PS 3 parameters of the PS 8 encoder. Noise reduction can be very aggressive and optimized for PS parameter extraction, as the DM downmix signal takes another path, which does not include the noise reduction stage 12. [00076] The DM mono downmix signal can be generated by adding the L, R channels with the same weight factors (eg using weight factors of 1 or using weight factors of 1/2). The DM signal then corresponds to the intermediate signal received. When using weight factors of 1/2, the amplitude of the DM signal is half the amplitude of the DM signal, in the case when using weight factors of 1. [00077] Optionally, some form of noise reduction can also be applied to the L/R signal or DM signal (and/or the So signal, if used). For example, some noise reduction can be applied to the DM signal (see optional noise reduction stage 11 in Figure 8). Preferably, this noise reduction stage is smoother than the aggressive noise reduction stage 12. The noise reduction stage 11 can alternatively be placed upstream of the downmix stage 9 (eg at the input of device 2 , or directly before the downmix stage 9). [00078] Under certain reception conditions, FM receiver 1 only provides a mono signal, with the side signal carried being muted. This will typically happen when reception conditions are very poor and the side signal is very noisy. In the case where the FM stereo receiver 1 has switched to mono reproduction of the stereo radio signal, the upmix stage preferably uses upmix parameters to blind the upmix, such as preset upmix parameters, and generates a stereo pseudo-signal, ie the upmix stage generates a stereo signal using the upmix parameters to blind the upmix. [00079] There are also embodiments of the FM stereo receiver 1 that switch, under poor reception conditions, to mono reproduction. If reception conditions are too poor to estimate reliable PS parameters 5, the upmix stage preferably uses upmix parameters to blind the upmix and generates a pseudo stereo signal based on this. [00080] Figure 9 shows an embodiment for stereo pseudo-generation in the case of only mono output from FM receiver 1. In this case, a mono/stereo detector 13 is used to detect whether the input signal to device 2 is mono, that is, if the L and R channel signals are equal. In the case of mono reproduction from FM receiver 1, the mono/stereo detector 13 indicates upmix to stereo, using, for example, a PS decoder with fixed upmix parameters. In other words: in this case, upmix stage 4 does not use the PS parameters of the PS parameter estimation stage 3 (not shown in figure 9), but uses the fixed upmix parameters (not shown in figure 9). [00081] Optionally, a speech detector 14 can be added to indicate whether the received signal is predominantly speech or music. This speech detector 14 provides a blind upmix dependent signal. For example, this speech detector 14 can allow a signal dependent on upmix parameters. Preferably, one or more upmix parameters can be used for speech and one or more different upmix parameters can be used for music. This speech detector 14 can be run by a Voice Activity Detector (VAD). Strictly speaking, the upmix stage 4 in Figure 9 comprises a decorrelator 10, a 2 x 2 upmix matrix, and a means for converting the output of the mono/stereo detector 13 and the speech detector 14 into some form of PS parameters, used as input to effective stereo upmix. [00082] Figure 10 illustrates a common problem when the audio signal provided by noise 1 switches between stereo and mono, due to bad reception conditions varying with time (for example, fading). To maintain a stereo sound image during mono/stereo switching, error concealment techniques can be used. The time intervals at which concealment should be applied are indicated by "C" in Figure 10. One approach to concealment in PS encoding is to use upmix parameters that are based on previously estimated PS parameters, in which case new PS parameters cannot be computed because the audio output of FM 1 receiver has dropped to mono. For example, upmix stage 4 may continue to use previously estimated PS parameters, in the case that new PS parameters cannot be computed because the audio output of FM receiver 1 has dropped to mono. In this way, FM stereo receiver 1 switches to mono audio output, stereo upmix stage 4 continues to use the previously estimated PS parameters from PS parameter estimation stage 3. If the drop-off periods at the stereo output are short enough, so that the stereo sound image of the FM radio signal remains similar during the dip period, the dip is not audible, or just barely audible, at the audio output of device 2. Another approach might be to interpolate and/ or extrapolate the upmix parameters from previously estimated parameters. With respect to determining upmix parameters based on previously estimated PS parameters, one can, in light of the teachings of the present invention, also use other known technologies, for example, error concealment mechanisms, which can be used in decoders to mitigate the effect of transmission errors (eg corrupted or missing data). [00083] The same approach of using upmix parameters, based on the previously estimated PS parameters, can also be applied if the FM receiver 1 provides a noisy stereo signal, for a short period of time, with the noisy stereo signal being too bad to estimate reliable PS parameters based on it. [00084] Next, an advanced 3' PS parameter estimation stage, which provides error compensation, is discussed with reference to figure 11. In the case of PS parameter estimation based on a stereo signal, containing a noisy side component , there may be an error in the calculation of PS parameters if conventional formulas are used to determine the PS parameters, such as for determining the CLD (Channel Level Differences) parameter and the ICC (Cross Correlation Between Channels) parameter. [00085] When considering that the noise in the side signal is independent of the intermediate signal: - the ICC values are closer to 0, compared to the estimated ICC values based on a noisy stereo signal; and - CLD values, in decibels, are closer to 0 dB compared to estimated CLD values based on a noisy stereo signal. [00086] For error compensation in the PS parameters, the apparatus 2 preferably has a noise estimation stage, which is configured to determine a noise parameter characteristic for the noise power of the received side signal, which was caused by (bad) radio transmission. The noise parameter is considered when estimating the PS parameters. This can be implemented as shown in figure 11. [00087] According to Fig. 11, the signal strength data 6 can be used to compensate, at least partially, for the error. Signal strength 6 is often available on FM radio receivers. The signal strength 6 is input to the parameter analysis stage 3 in the PS encoder 7. At a signal noise power estimation stage 15, the signal strength value 6 can be converted into a noise power estimate of sign N2, with N2 = E(n2), where "E()" is the expectation operator. As an alternative to signal strength 6, audio signal L, R can be used for estimating audio signal strength, as will be discussed below. [00088] The values of the effective noisy stereo input signals lw/noise and rw/noise, which are input in the 3' internal parameter estimation stage, shown in figure 11, can be expressed in dependence on the respective lw/noise and rw/noise without noise, and the noise values of the received side signal values: [00089] It should be noted that in the present invention the received side signal is modeled as s + n, where "s" is the original side signal (undistorted), and "n" is the noise (distortion signal) caused by the radio broadcast channel. Furthermore, it is considered that, in the present invention, the m signal is not distorted from the radio transmission channel. [00090] Thus, the corresponding input powers Lw/noise2, Rw/noise2 and the cross correlation Lw/noise Rw/noise can be written as: with the r power estimate of the side signal N2, with N2 = E(n2), where "E()" is the expectation operator. [00091] By rearranging the equations mentioned above, the compensated powers and the corresponding noise-free cross-correlation can be determined to be: [00092] An extraction of error compensated PS parameters, based on compensated powers and cross-correlation, can be conducted as shown by the following formulas: [00093] This parameter extraction offsets the estimated N2 term in the calculation of the PS parameters. [00094] In Figure 11, the signal noise power estimation stage 15 is configured to derive the noise power estimate N2 based on the signal strength 6 and/or the audio input signals (L and R) . The N2 noise power estimate can be either frequency-varying or time-varying. [00095] Several methods can be used to determine the noise power of the side signal N2, for example: - when detecting the minimum power of the intermediate signal (eg pauses in speech), it can be considered that the signal strength lateral is noise only (ie, the lateral signal strength corresponds to N2 in these situations); - the estimate of N2 can be defined by a function of the signal strength data 6; the function (or check table) can be elaborated by experimental (physical) measurements; - the estimate N2 can be defined by a function of the signal strength data 6 and/or audio input signals (L and R); the function can be elaborated by heuristic rules; and - the N2 estimate can be based on the study of the coherence of the signal type of the intermediate and lateral signals; the original mid and side signals can be, for example, considered to have a similar tone to noise ratio or crest factor, or other similar power envelope characteristics. Deviations from these properties can be used to indicate a high level of N2. [00096] In the following, other preferred embodiments of the audio processing apparatus 2 are discussed. [00097] Preferably, apparatus 2 is configured in such a way that for side signals received with practically only noise, apparatus 2 switches uniformly to a stereo pseudo-operation (blind upmix), as illustrated in figures 9 and 10. This allows to transmit a pseudo stereo signal at the output of device 2, in case the FM receiver 1 has been switched to mono operation (due to the high level of noise caused by poor reception conditions), or in the case of the side signal part of the signal stereo input 1 is so noisy that reliable PS parameters cannot be estimated. [00098] For side signals with almost no noise, device 2 preferably switches uniformly to normal stereo operation rather than parametric stereo operation. In normal stereo operation, the signal enhancement functionality of device 2 is essentially disabled. For deactivation, the audio signal, at the input of the device, can essentially be fed by the output of device 2. [00099] Alternatively, normal stereo operation can be done by using the received side signal So, as illustrated in figures 4 and 6. For normal stereo operation, the received side signal So is used for mixing in upmix stage 4. proper selection of upmix parameters in upmix stage 4, output signal L', R' of upmix stage 4 corresponds to output signal L, R of FM transmitter 1; for example, when mixing the mono DM downmix and the received signal So, according to: in the case DM = M = (L + R)/2 and So = (L - R)/2. [000100] Particularly, normal stereo mode or parametric stereo mode can be selected in a frequency-varying way, ie the selection can be different for different frequency bands. This is useful as the signal-to-noise ratio for the received side signal gets worse at higher frequencies. [000101] Uniform switching between the different operating modes can be dynamically adapted to the current reception conditions, to always provide the best possible stereo signal at the output of the device 2. In case of high signal-to-noise ratio (no noise reduction based on PS processing) a normal FM stereo operation is preferred, while in the case of a low signal to noise ratio, PS processing greatly improves the stereo signal. [000102] Preferably, the generation of the DM mono downmix in the PS 7 encoder should be done in such a way that, as much as possible, the side signal noise leaks into the DM mono downmix. This may require different downmix techniques than those typically used in a PS encoder (such as an MPEG-4 PS to MPEG-4 encoder), which is typically employed in the context of a very low bit rate encoding system. This can be as simple as a fixed (non-adaptive) downmix DM = M = (L + R)/2, where the downmix simply corresponds to the intermediate signal. Furthermore, the upmix in the PS 8 decoder is typically adapted to the effective downmix technique used in the PS 7 encoder. [000103] It should be noted that although in several drawings the PS 7 encoder and PS 8 decoder are shown as separate modules, it is, of course, advantageous in the context of an efficient implementation, to unite the PS 7 encoder and the decoder both as possible. [000104] The concepts discussed in this specification can be implemented in conjunction with any encoder using PS techniques, eg a HE-AAC v2 (High-Efficiency Advanced Audio Coding version 2) encoder as defined in the ISO/IEC standard 14496-3 (MPEG-4 Audio), an encoder based on MPEG Surround, or an encoder based on MPEG USAC (Speech and Unified Audio encoder), as well as encoders that are not covered by the MPEG standards. [000105] Next, by way of example, an HE-AAC v2 encoder is considered; nevertheless, the concepts can be used in conjunction with any audio encoder using PS techniques. [000106] HE-AAC is a lossy audio compression scheme. HE-AAC v1 (HE-AAC version 1) makes use of a spectral band replica (SBR) to increase compression efficiency. The HE-AAC v2 also includes a parametric stereo system to increase the efficiency of compressing stereo signals at very low bit rates. A HE-AAC v2 encoder inherently includes a PS encoder to allow operation at very low bit rates. The PS encoder of this HE-AAC v2 encoder can be used as the PS encoder 7 of the audio processing device 2. In particular, the PS parameter estimation stage inside a PS encoder of a HE-AAC v2 encoder can be used as the PS parameter estimation stage 3 of the audio processing device 2. Also, the downmix stage within a PS encoder of a HE-AAC v2 encoder can be used as the downmix stage 9 of the device 2. [000107] Therefore, the concept discussed in this descriptive report can be efficiently combined with a HE-AAC v2 encoder to promote an improved FM stereo radio receiver. This enhanced FM stereo radio receiver can have a HE-AAC v2 recording feature, as the HE-AAC v2 encoder transmits a HE-AAC v2 bitstream that can be stored for recording purposes. This is shown in Figure 12. In this embodiment, the apparatus 2 comprises a HE-AAC v2 encoder 16 and the PS 8 decoder. The HE-AAC v2 encoder provides the PS 7 encoder, used to generate the mono DM downmix and PS parameters. 5 as discussed in conjunction with the previous drawings. [000108] Optionally, the PS 7 encoder can be modified for the purpose of FM radio noise reduction, to support a fixed downmix scheme, such as a downmix scheme according to DM = (L + R)/a . [000109] The DM mono downmix and PS 8 parameters can be fed to the PS 8 decoder to generate the stereo signal L', R' as discussed above. The DM mono downmix is fed to a HE-AAC v2 encoder for perceptual encoding of the DM mono downmix. The resulting perceptual encoded audio signal and PS information are multiplexed into a HE-AAC v2 18 bitstream. For recording purposes, the HE-AAC v2 18 bitstream can be stored in a memory, such as a flash memory or a hard drive. [000110] The HE-AAC v1 encoder comprises an SBR encoder and an AAC encoder (not shown). The SBR encoder typically performs signal processing in the QMF domain (quadrature mirror filterbank), and therefore needs QMF samples. In comparison, the AAC encoder typically requires time domain samples (typically it is reduced in sample rate by a factor of 2). [000111] The PS encoder 7 inside the HE-AAC v2 encoder 16 typically provides the DM downmix signal already in the QMF domain. [000112] Since the PS 7 encoder can already send the DM signal in the QMF domain to the HE-AAC v1 encoder, the transform of the QMF analysis in the HE-AAC v1 encoder for the SBR analysis can be made obsolete. In this way, QMF analysis, which is normally part of the HE-AAC v1 encoder, can be avoided by providing the DM downmix signal as QMF samples. This reduces the computing effort and allows for simplification. [000113] The time domain samples for the AAC encoder can be derived from the input of apparatus 2, for example, by performing the simple operation DM = (L + R)/2 in the time domain and by reducing the sampling rate of the DM signal in the time domain. This approach is probably the cheapest approach. Alternatively, apparatus 2 can perform a half-rate QMF synthesis of the DM samples in the QMF domain. [000114] It should be noted that the PS encoder and PS decoder can be partially joined, if both are implemented in the same module.
权利要求:
Claims (34) [0001] 1. Apparatus for enhancing a left/right or intermediate/side audio signal emitted by an FM stereo radio receiver, the apparatus characterized in that it comprises: - an input stage configured to receive the left/right audio signal or mid/side from FM stereo radio receiver; - a downmix stage, the downmix stage configured to generate a first audio signal based on the left/right or intermediate/side audio signal by a downmix operation; - a parametric stereo parameter estimation stage, the parameter estimation stage configured to determine one or more parametric stereo parameters based on the left/right or intermediate/side audio signal in a frequency-variant or frequency-invariant manner; and - a stereo mixing module, the stereo mixing module configured to generate a stereo signal based on the first audio signal and the one or more parametric stereo parameters; in which the downmix stage, the parametric stereo parameter estimation stage and the stereo mix module are implemented in the same module. [0002] 2. Apparatus according to claim 1, characterized in that - the apparatus further comprises a decorrelator configured to generate a decorrelation signal based on the first audio signal; and - the stereo mixing module is configured to generate the stereo signal based - on the first audio signal, - on the one or more parametric stereo parameters, and - on the decorrelated signal or at least a frequency band thereof. [0003] 3. Apparatus according to claim 1 or 2, characterized in that the downmix stage is configured to generate the first audio signal according to the following formula: (L + R)/a, where L and R denote the left and right channels of the left/right audio signal and a is a real number. [0004] 4. Apparatus, according to any one of claims 1 to 3, characterized in that the first signal corresponds to an intermediate signal received. [0005] 5. Apparatus according to claim 1, characterized in that the stereo mixing module is configured to generate the stereo signal based - on the first audio signal, - on one or more parametric stereo parameters, and - on a second audio signal or at least a frequency band thereof, with the second audio signal being a received side signal or a residual signal, the residual signal indicating an error associated with left/right or intermediate/side audio signal representation by the first audio signal and the one or more parametric stereo parameters. [0006] 6. Apparatus according to claim 5, characterized in that the downmix stage is further configured to derive the second audio signal based on the left/right audio signal. [0007] 7. Apparatus according to claim 5, characterized in that - the apparatus further comprises a de-correlator receiving the first audio signal and emitting a de-correlated signal, and - the stereo mixing module generates the stereo signal selectively based on - in the second audio signal, or - in the uncorrelated signal, with the selection being frequency-invariant or frequency-variant. [0008] 8. Apparatus, according to claim 7, characterized by the fact that the selection is often variant. [0009] 9. Apparatus according to claim 8, characterized in that the stereo mixing module uses - the second audio signal for a first frequency band and - the uncorrelated signal for a second frequency band, with the frequencies of the first frequency band being lower than the frequencies of the second frequency band. [0010] 10. Apparatus according to claim 7, characterized in that the selection depends on - a radio reception indicator indicative of the radio reception condition, and/or - on a quality indicator indicative of the quality of the side signal Received. [0011] 11. Apparatus according to any one of claims 1 to 10, characterized in that the one or more parametric stereo parameters include a parameter indicating a channel level difference and/or a parameter indicating a cross-correlation between channels. [0012] 12. Apparatus according to any one of claims 1 to 11, characterized in that - the apparatus further comprises a noise reduction stage, the noise reduction stage for noise reduction of the first audio signal, and - the first reduced noise audio signal after noise reduction is fed to the stereo mix module to generate the stereo signal based on the first reduced noise audio signal and the one or more parametric stereo parameters. [0013] 13. Apparatus according to any one of claims 1 to 11, characterized in that - the apparatus further comprises a noise reduction stage for noise reduction of the left/right or intermediate/side audio signal, and - the A reduced noise left/right or mid/side audio signal after noise reduction is fed to the parametric stereo parameter estimation stage to generate the one or more parametric stereo parameters. [0014] 14. Apparatus according to claim 13, characterized in that - the first audio signal is obtained from the left/right or intermediate/side audio signal upstream of the noise reduction stage. [0015] 15. Apparatus according to any one of claims 1 to 11, characterized in that - the apparatus further comprises a noise estimation stage, the noise estimation stage configured to determine a noise parameter characteristic for the power noise of the received side signal; and - the parametric stereo parameter estimation stage is configured to determine the one or more parametric stereo parameters based on the left/right or middle/side audio signal and the noise parameter in a frequency-variant or frequency-invariant manner. [0016] 16. Apparatus according to any one of claims 1 to 15, characterized in that - the apparatus is configured to detect that the FM stereo receiver selects mono output of the stereo radio signal or the apparatus is configured to detect reception of poor radio; and - the stereo mix module uses one or more upmix parameters for blind upmix in case the device detects that the FM stereo receiver selects mono output of the stereo radio signal or the device detects poor reception. [0017] 17. Apparatus according to claim 16, characterized in that the one or more upmix parameters for blind upmix are one or more preset upmix parameters. [0018] 18. Apparatus according to claim 16, characterized in that - the apparatus further comprises a speech detector, the speech detector indicating whether the left/right or intermediate/lateral audio signal is predominantly speech, and - the one or more upmix parameters for blind upmix are dependent on the speech detector indication. [0019] 19. Apparatus according to any one of claims 1 to 15, characterized in that - the apparatus is configured to detect that the FM stereo receiver selects mono output of the stereo radio signal or the apparatus is configured to detect reception of poor radio; and - when the FM stereo receiver switches to mono output or poor radio reception occurs, the stereo mix module uses one or more upmix parameters that are based on one or more stereo parametric parameters estimated previously from the stereo parameter estimation stage parametric parameters. [0020] 20. Apparatus according to claim 19, characterized in that the stereo upmix stage continues to use the one or more stereo parametric parameters estimated previously from the parametric stereo parameter estimation stage as upmix parameters when the FM stereo receiver switches to mono output or poor radio reception occurs. [0021] 21. Apparatus according to any one of claims 1 to 15, characterized in that - the apparatus is configured to detect good radio reception on the FM stereo radio receiver; - the input stage is configured to receive the left/right audio signal from the FM stereo radio receiver; - when the device detects good radio reception, the device selects normal stereo mode; and - in a normal stereo mode, the stereo signal corresponds to the left/right audio signal. [0022] 22. Apparatus according to any one of claims 1 to 21, characterized in that the apparatus is operable to select normal stereo mode in a frequency variant mode. [0023] 23. Apparatus according to any one of claims 1 to 22, characterized in that the apparatus comprises: - a stereo parametric encoder having the stage of estimating parametric stereo parameters; and - a parametric stereo decoder having the stereo mix module. [0024] 24. Apparatus according to any one of claims 1 to 22, characterized in that the apparatus comprises an audio encoder supporting parametric stereo, the audio encoder comprising a parametric stereo encoder, with parametric stereo parameter estimation stage being part of the parametric stereo encoder. [0025] 25. Apparatus according to claim 24, characterized in that the audio encoder is a HE-AAC v2 audio encoder. [0026] 26. Apparatus according to claim 24, characterized in that the audio encoder emits a stream of audio bits. [0027] 27. Apparatus according to claim 25, characterized in that the HE-AAC v2 audio encoder emits a HE-AAC v2 bit stream. [0028] 28. Apparatus according to claim 26, characterized in that - the HE-AAC v2 encoder comprises, downstream of the parametric stereo encoder, a HE-AAC v1 encoder, - the first audio signal is a signal in the domain QMF and the first audio signal is transported to the HE-AAC v1 encoder, and - the HE-AAC v1 encoder does not perform QMF analysis of the first audio signal. [0029] 29. Apparatus according to claim 1, characterized in that it further comprises: a first noise reduction stage configured to reduce noise in the left/right or intermediate/side audio signal being inserted into the parameter estimation stage parametric stereos; a second noise reduction stage configured to reduce noise in the first audio signal being input to the stereo mix module; where the first noise reduction stage is configured to perform greater noise reduction than the second noise reduction stage. [0030] 30. FM stereo radio receiver characterized in that it is configured to receive an FM radio signal comprising an intermediate signal and a side signal, and having an apparatus as defined in any one of claims 1 to 29. [0031] 31. Mobile communication device characterized in that it comprises: - an FM stereo receiver configured to receive an FM radio signal comprising an intermediate signal and a side signal; and - an apparatus as defined in any one of claims 1 to 29. [0032] 32. Method for enhancing a left/right or middle/side audio signal from an FM stereo radio receiver, the FM stereo radio receiver configured to receive an FM radio signal, the method characterized in that it comprises the steps of: - receiving left/right or middle/side audio signal from FM stereo radio receiver; - generate a first audio signal based on the left/right or intermediate/side audio signal by a downmix operation; - determine one or more parametric stereo parameters based on the left/right or intermediate/side audio signal in a frequency-variant or frequency-invariant manner; and - generating a stereo signal based on the first audio signal and the one or more parametric stereo parameters by an upmix operation in which the steps of generating a first audio signal, determining and generating a stereo signal are conducted in a same module. [0033] 33. Method according to claim 32, characterized in that the method further comprises the step of: - generating an uncorrelated signal based on the first audio signal, and the stereo signal is generated by the upmix operation, based on on the first audio signal, on the uncorrelated signal, and on the one or more parametric stereo parameters. [0034] 34. Method according to claim 32, characterized in that the method further comprises: reducing noise in the left/right or intermediate/side audio signal before the step of determining one or more parametric stereo parameters; reduce noise in the first audio signal before the step of generating the stereo signal; where noise reduction on the left/right or middle/side audio signal effects a greater noise reduction than noise reduction on the first audio signal.
类似技术:
公开号 | 公开日 | 专利标题 BR112012005534B1|2021-08-17|DEVICE, FM STEREO RADIO RECEIVER, MOBILE COMMUNICATION DEVICE, AND METHOD TO IMPROVE A LEFT/RIGHT OR INTERMEDIATE/SIDE AUDIO SIGNAL TWI516138B|2016-01-01|System and method of determining a parametric stereo parameter from a two-channel audio signal and computer program product thereof US20200388293A1|2020-12-10|Multi-channel audio decoder, multi-channel audio encoder, methods and computer program using a residual-signal-based adjustment of a contribution of a decorrelated signal ES2526320T3|2015-01-09|Hiding intermittent mono reception of FM stereo radio receivers KR101585852B1|2016-01-15|High quality detection in fm stereo radio signals JP2008519306A|2008-06-05|Encode and decode signal pairs
同族专利:
公开号 | 公开日 US9877132B2|2018-01-23| JP2014017829A|2014-01-30| JP2013504908A|2013-02-07| EP3035712B1|2017-11-08| EP2476269B1|2016-03-16| EP3035712A1|2016-06-22| US20120207307A1|2012-08-16| HK1168219A1|2012-12-21| HK1220067A1|2017-04-21| ES2571707T3|2016-05-26| TWI433137B|2014-04-01| JP5393892B2|2014-01-22| CN102598717B|2014-12-17| US8929558B2|2015-01-06| EP2476269A1|2012-07-18| ES2655972T3|2018-02-22| WO2011029570A1|2011-03-17| TW201137856A|2011-11-01| JP5635662B2|2014-12-03| US20150086022A1|2015-03-26| CN102598717A|2012-07-18| BR112012005534A2|2021-03-30| RU2491763C1|2013-08-27| WO2011029570A8|2011-05-05|
引用文献:
公开号 | 申请日 | 公开日 | 申请人 | 专利标题 DE244666C|1911-03-11|1912-03-15|Stanzwerk Oberscheden Juenemann & Co|Method and device for sterilizing milk| US3823268A|1972-06-07|1974-07-09|Mc Intosh Labor Inc|Dynamic stereo separation control| JPS6340379B2|1980-08-07|1988-08-10|Clarion Co Ltd| DE3048263A1|1980-12-20|1982-07-29|Blaupunkt-Werke Gmbh, 3200 Hildesheim|BROADCAST RECEIVER| US4390749A|1981-04-13|1983-06-28|Superscope, Inc.|Noise control system for FM radio| JPS5952937A|1982-09-20|1984-03-27|Pioneer Electronic Corp|Multiplex transmitting system| US4485483A|1983-03-18|1984-11-27|Torick Emil L|FM Stereophonic system incorporating companding of difference signal| US4496979A|1983-11-22|1985-01-29|Casat Technology, Inc.|FM High-fidelity processor| US4602380A|1985-01-04|1986-07-22|Cbs Inc.|Compatible transmission techniques for FM stereophonic radio and television| AU578792B2|1985-01-04|1988-11-03|Btp International|Broadcast stereo companding system and apparatus| JPS61242133A|1985-04-19|1986-10-28|Hitachi Ltd|Radio receiver| DD244666A1|1985-12-19|1987-04-08|Halbleiterwerk Veb|PROCESS FOR NOISE REDUCTION| JPH0347016B2|1986-01-25|1991-07-18|Fujitsu Ten Ltd| JPS63194437A|1987-02-09|1988-08-11|Alpine Electron Inc|Quasi stereo circuit| US4833715A|1987-03-06|1989-05-23|Alps Electric Co., Ltd.|FM stereo receiver| JPS6472636A|1987-09-14|1989-03-17|Mitsubishi Electric Corp|Multi-path noise reduction circuit| SU1601758A1|1988-12-20|1990-10-23|Уральский политехнический институт им.С.М.Кирова|Am/fm transceiving system| WO1991008624A1|1989-11-30|1991-06-13|Motorola, Inc.|Method and apparatus for random fm noise cancellation| JPH03259624A|1990-03-09|1991-11-19|Pioneer Electron Corp|Noise reduction circuit| US5249233A|1992-04-06|1993-09-28|Ford Motor Company|Multipath noise minimizer for radio receiver| JPH06291692A|1993-03-31|1994-10-18|Victor Co Of Japan Ltd|Sound field and sound quality controller for fm radio receiver| JPH0846585A|1994-07-27|1996-02-16|Fujitsu Ten Ltd|Stereophonic reception device| JP3259624B2|1996-01-31|2002-02-25|日産自動車株式会社|Surface condition inspection device| JPH1072636A|1996-08-30|1998-03-17|Honda Motor Co Ltd|Magnetostrictive member and its production| US6178316B1|1997-04-29|2001-01-23|Meta-C Corporation|Radio frequency modulation employing a periodic transformation system| DE19808818A1|1998-03-03|1999-09-09|Grundig Ag|Radio receiver for a vehicle| US6539357B1|1999-04-29|2003-03-25|Agere Systems Inc.|Technique for parametric coding of a signal containing information| JP2000332710A|1999-05-24|2000-11-30|Sanyo Electric Co Ltd|Receiver for stereophonic broadcast| DE60014790T2|1999-07-15|2006-02-09|Mitsubishi Denki K.K.|Device for reducing noise| JP3473511B2|1999-07-22|2003-12-08|三菱電機株式会社|Multipath noise elimination device, audio output device, and FM receiver| JP3368879B2|1999-12-22|2003-01-20|三菱電機株式会社|Multipath noise elimination device, audio output device, and FM receiver| EP1206043B1|2000-11-08|2009-12-23|Sony Deutschland GmbH|Noise reduction in a stereo receiver| SE0202159D0|2001-07-10|2002-07-09|Coding Technologies Sweden Ab|Efficientand scalable parametric stereo coding for low bitrate applications| US8605911B2|2001-07-10|2013-12-10|Dolby International Ab|Efficient and scalable parametric stereo coding for low bitrate audio coding applications| JP4151243B2|2001-07-26|2008-09-17|三菱電機株式会社|Multipath noise removal method and removal apparatus, FM receiver| US20030087618A1|2001-11-08|2003-05-08|Junsong Li|Digital FM stereo decoder and method of operation| DE10202635B4|2002-01-24|2006-05-24|Harman/Becker Automotive Systems Gmbh|Method and circuit for noise reduction| DE10202639A1|2002-01-24|2003-08-21|Harman Becker Automotive Sys|Method for reducing noise in a stereo radio receiver and stereo radio receiver| JP3963747B2|2002-03-20|2007-08-22|三洋電機株式会社|Signal processing apparatus, signal receiving apparatus, and signal processing method| RU42145U1|2002-03-21|2004-11-20|Гладков Борис Васильевич|MULTI-STAGE ELECTRO-ACOUSTIC RADIATOR| BR0304540A|2002-04-22|2004-07-20|Koninkl Philips Electronics Nv|Methods for encoding an audio signal, and for decoding an encoded audio signal, encoder for encoding an audio signal, apparatus for providing an audio signal, encoded audio signal, storage medium, and decoder for decoding an audio signal. encoded audio| TWI301361B|2002-04-26|2008-09-21|Niigata Seimitsu Co Ltd| ES2294300T3|2002-07-12|2008-04-01|Koninklijke Philips Electronics N.V.|AUDIO CODING| AU2003281128A1|2002-07-16|2004-02-02|Koninklijke Philips Electronics N.V.|Audio coding| EP1595247B1|2003-02-11|2006-09-13|Koninklijke Philips Electronics N.V.|Audio coding| GB0304126D0|2003-02-24|2003-03-26|1 Ltd|Sound beam loudspeaker system| WO2004077690A2|2003-02-26|2004-09-10|Koninklijke Philips Electronics N.V.|Noise cancellation system in an analog fm receiver| WO2004086817A2|2003-03-24|2004-10-07|Koninklijke Philips Electronics N.V.|Coding of main and side signal representing a multichannel signal| US7835916B2|2003-12-19|2010-11-16|Telefonaktiebolaget Lm Ericsson |Channel signal concealment in multi-channel audio systems| US7583805B2|2004-02-12|2009-09-01|Agere Systems Inc.|Late reverberation-based synthesis of auditory scenes| US7391870B2|2004-07-09|2008-06-24|Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E V|Apparatus and method for generating a multi-channel output signal| US7720230B2|2004-10-20|2010-05-18|Agere Systems, Inc.|Individual channel shaping for BCC schemes and the like| DE202004016975U1|2004-11-01|2004-12-30|Autoliv Development Ab|Airbag device with a destructible connecting section between a gas generator and a non-metallic gas lance| SE0402650D0|2004-11-02|2004-11-02|Coding Tech Ab|Improved parametric stereo compatible coding or spatial audio| EP1851866B1|2005-02-23|2011-08-17|Telefonaktiebolaget LM Ericsson |Adaptive bit allocation for multi-channel audio encoding| SG161227A1|2005-04-04|2010-05-27|That Corp|Signal quality estimation and control system| US7751572B2|2005-04-15|2010-07-06|Dolby International Ab|Adaptive residual audio coding| JP2006303799A|2005-04-19|2006-11-02|Mitsubishi Electric Corp|Audio signal regeneration apparatus| JP4542482B2|2005-08-26|2010-09-15|株式会社ケンウッド|Residual noise reduction circuit| JP2007129511A|2005-11-04|2007-05-24|Sony Corp|Sound output apparatus and method, program, and recording medium| KR101358700B1|2006-02-21|2014-02-07|코닌클리케 필립스 엔.브이.|Audio encoding and decoding| US7965848B2|2006-03-29|2011-06-21|Dolby International Ab|Reduced number of channels decoding| JP2007274061A|2006-03-30|2007-10-18|Yamaha Corp|Sound image localizer and av system| JP5513887B2|2006-09-14|2014-06-04|コーニンクレッカフィリップスエヌヴェ|Sweet spot operation for multi-channel signals| JP4930320B2|2006-11-30|2012-05-16|ソニー株式会社|Reproduction method and apparatus, program, and recording medium| JP2009010841A|2007-06-29|2009-01-15|Kenwood Corp|Stereophonic demodulation device and its method| KR101513028B1|2007-07-02|2015-04-17|엘지전자 주식회사|broadcasting receiver and method of processing broadcast signal| US20090164223A1|2007-12-19|2009-06-25|Dts, Inc.|Lossless multi-channel audio codec| KR101756834B1|2008-07-14|2017-07-12|삼성전자주식회사|Method and apparatus for encoding and decoding of speech and audio signal| KR101367604B1|2009-03-17|2014-02-26|돌비 인터네셔널 에이비|Advanced stereo coding based on a combination of adaptively selectable left/right or mid/side stereo coding and of parametric stereo coding| CN103718466B|2011-08-04|2016-08-17|杜比国际公司|By using parametric stereo to improve FM stereo radio electricity receptor|JPS6169751A|1984-09-13|1986-04-10|Nippon Tokushu Noyaku Seizo Kk|Salicylamide derivative, its preparation and agent for controlling crubroot of cruficerous plant| RU2012106341A|2009-07-22|2013-08-27|Стормингсвисс Гмбх|DEVICE AND METHOD FOR OPTIMIZING STEREOPHONIC OR PSEUDOSTEREOFONIC AUDIO SIGNALS| JP5581449B2|2010-08-24|2014-08-27|ドルビー・インターナショナル・アーベー|Concealment of intermittent mono reception of FM stereo radio receiver| TWI516138B|2010-08-24|2016-01-01|杜比國際公司|System and method of determining a parametric stereo parameter from a two-channel audio signal and computer program product thereof| EP2477188A1|2011-01-18|2012-07-18|Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.|Encoding and decoding of slot positions of events in an audio signal frame| CN103718466B|2011-08-04|2016-08-17|杜比国际公司|By using parametric stereo to improve FM stereo radio electricity receptor| KR101616700B1|2011-09-29|2016-05-11|돌비 인터네셔널 에이비|Prediction-based fm stereo radio noise reduction| CN103918030B|2011-09-29|2016-08-17|杜比国际公司|High quality detection in the FM stereo radio signal of telecommunication| US9178553B2|2012-01-31|2015-11-03|Broadcom Corporation|Systems and methods for enhancing audio quality of FM receivers| US9130643B2|2012-01-31|2015-09-08|Broadcom Corporation|Systems and methods for enhancing audio quality of FM receivers| US9601122B2|2012-06-14|2017-03-21|Dolby International Ab|Smooth configuration switching for multichannel audio| TR201906190T4|2013-01-29|2019-05-21|Fraunhofer Ges Forschung|The decoder for generating a frequency-enhanced audio signal, the method for decoding, the encoder for generating an encoded signal, and the method for encoding the compact selection side information.| EP2790419A1|2013-04-12|2014-10-15|Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.|Apparatus and method for center signal scaling and stereophonic enhancement based on a signal-to-downmix ratio| EP2830053A1|2013-07-22|2015-01-28|Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.|Multi-channel audio decoder, multi-channel audio encoder, methods and computer program using a residual-signal-based adjustment of a contribution of a decorrelated signal| AU2014295207B2|2013-07-22|2017-02-02|Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V.|Multi-channel audio decoder, multi-channel audio encoder, methods, computer program and encoded audio representation using a decorrelation of rendered audio signals| EP2830334A1|2013-07-22|2015-01-28|Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.|Multi-channel audio decoder, multi-channel audio encoder, methods, computer program and encoded audio representation using a decorrelation of rendered audio signals| US10141004B2|2013-08-28|2018-11-27|Dolby Laboratories Licensing Corporation|Hybrid waveform-coded and parametric-coded speech enhancement| TWI634547B|2013-09-12|2018-09-01|瑞典商杜比國際公司|Decoding method, decoding device, encoding method, and encoding device in multichannel audio system comprising at least four audio channels, and computer program product comprising computer-readable medium| ES2641538T3|2013-09-12|2017-11-10|Dolby International Ab|Multichannel audio content encoding| US9143087B2|2013-11-19|2015-09-22|Qualcomm Incorporated|Adaptive FM demodulator supporting multiple modes| US9866986B2|2014-01-24|2018-01-09|Sony Corporation|Audio speaker system with virtual music performance| US9826332B2|2016-02-09|2017-11-21|Sony Corporation|Centralized wireless speaker system| US9924291B2|2016-02-16|2018-03-20|Sony Corporation|Distributed wireless speaker system| US11234072B2|2016-02-18|2022-01-25|Dolby Laboratories Licensing Corporation|Processing of microphone signals for spatial playback| US9826330B2|2016-03-14|2017-11-21|Sony Corporation|Gimbal-mounted linear ultrasonic speaker assembly| US9794724B1|2016-07-20|2017-10-17|Sony Corporation|Ultrasonic speaker assembly using variable carrier frequency to establish third dimension sound locating| US10210881B2|2016-09-16|2019-02-19|Nokia Technologies Oy|Protected extended playback mode| US9854362B1|2016-10-20|2017-12-26|Sony Corporation|Networked speaker system with LED-based wireless communication and object detection| US9924286B1|2016-10-20|2018-03-20|Sony Corporation|Networked speaker system with LED-based wireless communication and personal identifier| US10075791B2|2016-10-20|2018-09-11|Sony Corporation|Networked speaker system with LED-based wireless communication and room mapping|
法律状态:
2021-04-13| B15K| Others concerning applications: alteration of classification|Free format text: AS CLASSIFICACOES ANTERIORES ERAM: H04S 1/00 , H04S 5/00 , H04H 40/45 , H04B 1/16 , H04H 40/81 Ipc: H04H 40/72 (2008.01), H04H 40/81 (2008.01), ¢...! | 2021-04-13| B06F| Objections, documents and/or translations needed after an examination request according [chapter 6.6 patent gazette]| 2021-04-13| B06U| Preliminary requirement: requests with searches performed by other patent offices: procedure suspended [chapter 6.21 patent gazette]| 2021-07-06| B09A| Decision: intention to grant [chapter 9.1 patent gazette]| 2021-08-17| B16A| Patent or certificate of addition of invention granted [chapter 16.1 patent gazette]|Free format text: PRAZO DE VALIDADE: 20 (VINTE) ANOS CONTADOS A PARTIR DE 07/09/2010, OBSERVADAS AS CONDICOES LEGAIS. PATENTE CONCEDIDA CONFORME ADI 5.529/DF, QUE DETERMINA A ALTERACAO DO PRAZO DE CONCESSAO. |
优先权:
[返回顶部]
申请号 | 申请日 | 专利标题 US24111309P| true| 2009-09-10|2009-09-10| US61/241,113|2009-09-10| PCT/EP2010/005481|WO2011029570A1|2009-09-10|2010-09-07|Improvement of an audio signal of an fm stereo radio receiver by using parametric stereo| 相关专利
Sulfonates, polymers, resist compositions and patterning process
Washing machine
Washing machine
Device for fixture finishing and tension adjusting of membrane
Structure for Equipping Band in a Plane Cathode Ray Tube
Process for preparation of 7 alpha-carboxyl 9, 11-epoxy steroids and intermediates useful therein an
国家/地区
|